PhEDEx high-throughput data transfer management system

نویسنده

  • J. Rehn
چکیده

Distributed data management at LHC scales is a staggering task, accompanied by equally challenging practical management issues with storage systems and widearea networks. CMS data transfer management system, PhEDEx, is designed to handle this task with minimum operator effort, automating the workflows from large scale distribution of HEP experiment datasets down to reliable and scalable transfers of individual files over frequently unreliable infrastructure. Over the last year PhEDEx has matured to the point of handling virtually all CMS production data transfers. CMS pushes equally its own components to perform and the heavy investment into peer projects at all levels, from technical details to grid standards to worldwide projects, to ensure the end-to-end service is of sufficient quality. We present the throughput and service quality we have reached in the current daily 24/7 production work, the steps taken in LCG service challenges for the next generation transfer service, and the resulting changes in performance. We also report results from our scalability stress tests on PhEDEx alone. We offer an analysis of transfer-related problems we have encountered and how they have been affecting CMS data management.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Virtual Circuits in PhEDEx, an update from the ANSE project

The ANSE project has been working with the CMS and ATLAS experiments to bring network awareness into their middleware stacks. For CMS, this means enabling control of virtual network circuits in PhEDEx, the CMS data-transfer management system. PhEDEx orchestrates the transfer of data around the CMS experiment to the tune of 1 PB per week spread over about 70 sites. The goal of ANSE is to improve...

متن کامل

PhEDEx – The Main Component in Data Management System for the CMS Experiment

The CMS experiment along with other major LHC experiments produces enormous amounts of data that need to be managed at the level of storage systems and distributed computing resources. Physics Experiment Data Export (PhEDEx) has been developed as a data management system for the CMS experiment and handled the assigned tasks very well so far. We present a short overview of this system pointing o...

متن کامل

Throughput Maximization for Multi-Slot Data Transmission via Two-Hop DF SWIPT-Based UAV System

In this paper, an unmanned aerial vehicle (UAV) assisted cooperative communication system is studied, wherein a source transmits information to the destination through an energy harvesting decode-and-forward UAV. It is assumed that the UAV can freely move in between the source-destination pair to set up line of sight communications with the both nodes. Since the battery of the UAV may be limite...

متن کامل

The CMS PhEDEx System: a Novel Approach to Robust Grid Data Distribution

The CMS experiment has taken a novel approach to Grid data distribution. Instead of having a central processing component making global decisions on replica allocation, CMS has a data management layer composed of a series of collaborating agents; the agents are persistent, stateless processes which manage specific parts of replication operations at each site in the distribution network. The age...

متن کامل

Analysis of high-throughput plant image data with the information system IAP

This work presents a sophisticated information system, the Integrated Analysis Platform (IAP), an approach supporting large-scale image analysis for different species and imaging systems. In its current form, IAP supports the investigation of Maize, Barley and Arabidopsis plants based on images obtained in different spectra. Several components of the IAP system, which are described in this work...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006